Order Acceptance under Uncertainty: a Reinforcement Learning Approach

نویسندگان

  • Peter Schuur
  • Martijn van Otterlo
  • Marko Snoek
چکیده

Beta is the largest research centre in the Netherlands in the field of operations management in technology-intensive environments. The mission of Beta is to carry out fundamental and applied research on the analysis, design and control of operational processes. PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit Twente, op gezag van de rector magnificus, prof.dr. Acknowledgements Many people have contributed to this thesis in different ways. I thank Aart van Harten and Peter Schuur very much for giving me the posibility to work in this project, for the nice discussions we had on several topics that helped me to finish this thesis. Special thanks go to the team from computer science: Martijn van Otterlo, Marko Snoek and Mannes Pool; for introducing me on the field of Reinforcement Learning, for the many discussions we had of all kind and for the yearly team-lunch we had. Thanks to the BETA school for organizing the PhD activitiess. Thanks to Mark Ebben, Erwin Hans and Floris Olde Weghuis for the insights in order acceptance and for facilitating me their integrated/generic simulation environment. Thanks to Jorge Luis for his help with the implementation of the ANNs to work with in eM-Plant. Thanks to Prof. for participating in my graduation committee. This thesis has taken many years to come to an end, and I could not have gone through this period without the support of all the many wonderful people I have met here and there, at both sides of the ocean. From one side. First, my Dutch non-dutch family: Adriana and Andrei. From the other side. First my parents and brother, who have always been with me in a way, for all their support and their lifes as models of eternal study.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Reinforcement Learning Solutions for Energy Microgrids Management

This paper addresses the problem of efficiently operating the storage devices in an electricity microgrid featuring photovoltaic (PV) panels with both shortand long-term storage capacities. The problem of optimally activating the storage devices is formulated as a sequential decision making problem under uncertainty where, at every time-step, the uncertainty comes from the lack of knowledge abo...

متن کامل

Neuro-Genetic Order Acceptance in a Job Shop setting

In this paper a new neuro-genetic architecture is presented that solves a profit oriented dynamic job shop problem. In the job shop order acceptance and scheduling problem new jobs arrive continuously and because of insufficient job shop capacity, a selection has to be made among the offered jobs. The goal is to find an order acceptance policy, which is supported by a scheduling policy, that ma...

متن کامل

A Bayesian Approach to Multiagent Reinforcement Learning

A Bayesian Approach to Multiagent Reinforcement Learning and Coalition Formation under Uncertainty Georgios Chalkiadakis Doctor of Philosophy Graduate Department of Computer Science University of Toronto 2007 Sequential decision making under uncertainty is always a challenge for autonomous agents populating a multiagent environment, since their behaviour is inevitably influenced by the behaviou...

متن کامل

Reinforcement learning under circumstances beyond its control

Decision theory addresses the task of choosing an action; it provides robust decision-making criteria that support decision-making under conditions of uncertainty or risk. Decision theory has been applied to produce reinforcement learning algorithms that manage uncertainty in state-transitions. However, performance when there is uncertainty regarding the selection of future actions must also be...

متن کامل

Multicast Routing in Wireless Sensor Networks: A Distributed Reinforcement Learning Approach

Wireless Sensor Networks (WSNs) are consist of independent distributed sensors with storing, processing, sensing and communication capabilities to monitor physical or environmental conditions. There are number of challenges in WSNs because of limitation of battery power, communications, computation and storage space. In the recent years, computational intelligence approaches such as evolutionar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006